\ SEQUENCE LISTING 

(1) GENERAL ^INFORMATION: 

(i) APPLICANT: Mathews, Lawrence S. 
\ Vale, Wylie W. 

\ Tsuchida, Kunihiro 

(ii) TITLE ©F INVENTION: CLONING AND RECOMBINANT PRODUCTION OF 
RECEPTOR (S) OF THE ACTIVIN/TGF-BETA SUPERFAMILY 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pretty, Schroeder, Brueggemann & Clark 

(B) STREET: 444 South Flower Street, Suite 2000 

(C) CITY: lLos Angeles 

(D) STATE :\CA 

(E) COUNTRY: USA 

(F) ZIP: 90071 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TfflPE: Floppy disk 

(B) COMPUTER : \ IBM PC compatible 

(C) OPERATING ^SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION WjMBER: 

(B) FILING DATE ; \ 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : US 08/300,584 

(B) FILING DATE: Q2YSEP-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/880,220 

(B) FILING DATE: 08-MAY-1992 

(vii) PRIOR APPLICATION DATA : \ 

(A) APPLICATION NUMBER A US 07/773,229 

(B) FILING DATE: 09-OCT-U991 

(vii) PRIOR APPLICATION DATA: \ 

(A) APPLICATION NUMBER: vk 07/698 , 709 

(B) FILING DATE: 10 -MAY- 1991 

(viii) ATTORNEY/ AGENT INFORMATION: \ 

(A) NAME: Reiter, Stephen E.\ 

(B) REGISTRATION NUMBER: 31,i92 

(C) REFERENCE/DOCKET NUMBER: P41 9927 

(ix) TELECOMMUNICATION INFORMATION; \ 

(A) TELEPHONE; 619-546-4737 \ 

(B) TELEFAX: 619-546-9392 \ 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 71.. 1609 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CTCCGAGGAA GACCCAGGGA ACTGGATATC TAGCGAGAAC TTCCTACGGC TTCTCCGGCG 60 

CCTCGGGAAA ATG GGA GCT GCT GCA AAG TTG GCG TTC GCC GTC TTT CTT 109 
Met Gly Ala Ala Ala Lys Leu Ala Phe Ala Val Phe Leu 
15 10 

ATC TCT TGC TCT TCA GGT GCT ATA CTT GGC AGA TCA GAA ACT CAG GAG 157 
lie Ser Cys Ser Ser Gly Ala lie Leu Gly Arg Ser Glu Thr Gin Glu 
15 20 25 

TGT CTT TTC TTT AAT GCT AAT TGG GAA AGA GAC AGA ACC AAC CAG ACT 205 
Cys Leu Phe Phe Asn Ala Asn Trp Glu Arg Asp Arg Thr Asn Gin Thr 
30 35 40 45 

GGT GTT GAA CCT TGC TAT GGT GAT AAA GAT AAA CGG CGA CAT TGT TTT 253 

13 G1 Y Val Glu Pro Cys Tyr Gly Asp Lys Asp Lys Arg Arg His Cys Phe 

>U 50 55 60 

v.. I 

^ GCT ACC TGG AAG AAT ATT TCT GGT TCC ATT GAA ATA GTG AAG CAA GGT 301 

:?! Ala Th *" Trp Lys Asn lie Ser Gly Ser lie Glu lie Val Lys Gin Gly 

RJ 65 70 75 



TGT TGG CTG GAT GAT ATC AAC TGC TAT GAC AGG ACT GAT TGT ATA GAA 34 9 

Cys Trp Leu Asp Asp lie Asn Cys Tyr Asp Arg Thr Asp Cys lie Glu 
80 85 90 

AAA AAA GAC AGC CCT GAA GTG TAC TTT TGT TGC TGT GAG GGC AAT ATG 397 
Lys Lys Asp Ser Pro Glu Val Tyr Phe Cys Cys Cys Glu Gly Asn Met 
95 100 105 

TGT AAT GAA AAG TTC TCT TAT TTT CCG GAG ATG GAA GTC ACA CAG CCC 44 5 

Cys Asn Glu Lys Phe Ser Tyr Phe Pro Glu Met Glu Val Thr Gin Pro 
110 115 120 125 

ACT TCA AAT CCT GTT ACA CCG AAG CCA CCC TAT TAC AAC ATT CTG CTG 493 
Thr Ser Asn Pro Val Thr Pro Lys Pro Pro Tyr Tyr Asn lie Leu Leu 
130 135 140 

TAT TCC TTG GTA CCA CTA ATG TTA ATT GCA GGA ATT GTC ATT TGT GCA 541 
Tyr Ser Leu Val Pro Leu Met Leu lie Ala Gly lie Val lie Cys Ala 
145 150 155 

TTT TGG GTG TAC AGA CAT CAC AAG ATG GCC TAC CCT CCT GTA CTT GTT 589 
Phe Trp Val Tyr Arg His His Lys Met Ala Tyr Pro Pro Val Leu Val 
160 165 170 

CCT ACT CAA GAC CCA GGA CCA CCC CCA CCT TCC CCA TTA CTA GGG TTG 637 
Pro Thr Gin Asp Pro Gly Pro Pro Pro Pro Ser Pro Leu Leu Gly Leu 
175 180 185 

AAG CCA TTG CAG CTG TTA GAA GTG AAA GCA AGG GGA AGA TTT GGT TGT 685 
Lys Pro Leu Gin Leu Leu Glu Val Lys Ala Arg Gly Arg Phe Gly Cys 

195 200 205 

GTC TGG AAA GCC CAG TTG CTC AAT GAA TAT GTG GCT GTC AAA ATA TTT 733 
Val Trp Lys Ala Gin Leu Leu Asn Glu Tyr Val Ala Val Lys He Phe 
210 215 220 



Q 



o 
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CCA ATA CAG GAC AAA CAG TCC TGG CAG AAT GAA TAT GAA GTC TAT AGT 781 
Pro lie Gin Asp Lys Gin Ser Trp Gin Asn Glu Tyr Glu Val Tyr Ser 
225 230 235 

CTA CCT GGA ATG AAG CAT GAG AAC ATA CTA CAG TTC ATT GGT GCA GAG 829 
Leu Pro Gly Met Lys His Glu Asn lie Leu Gin Phe lie Gly Ala Glu 
240 245 250 

AAA AGA GGC ACC AGT GTG GAT GTG GAC CTG TGG CTA ATC ACA GCA TTT 877 
Lys Arg Gly Thr Ser Val Asp Val Asp Leu Trp Leu lie Thr Ala Phe 
255 260 265 

CAT GAA AAG GGC TCA CTG TCA GAC TTT CTT AAG GCT AAT GTG GTC TCT 92 5 

His Glu Lys Gly Ser Leu Ser Asp Phe Leu Lys Ala Asn Val Val Ser 
270 275 280 285 

TGG AAT GAA CTT TGT CAT ATT GCA GAA ACC ATG GCT AGA GGA TTG GCA 973 
Trp Asn Glu Leu Cys His lie Ala Glu Thr Met Ala Arg Gly Leu Ala 
290 295 300 

TAT TTA CAT GAG GAT ATA CCT GGC TTA AAA GAT GGC CAC AAG CCT GCA 1021 
Tyr Leu His Glu Asp lie Pro Gly Leu Lys Asp Gly His Lys Pro Ala 
305 310 315 

ATC TCT CAC AGG GAC ATC AAA AGT AAA AAT GTG CTG TTG AAA AAC AAT 1069 
lie Ser His Arg Asp lie Lys Ser Lys Asn Val Leu Leu Lys Asn Asn 
320 325 330 



flj CTG ACA GCT TGC ATT GCT GAC TTT GGG TTG GCC TTA AAG TTC GAG GCT 1117 

m Leu Thr Ala Cys lie Ala Asp Phe Gly Leu Ala Leu Lys Phe Glu Ala 

335 340 345 



GGC AAG TCT GCA GGT GAC ACC CAT GGG CAG GTT GGT ACC CGG AGG TAT 1165 
Gly Lys Ser Ala Gly Asp Thr His Gly Gin Val Gly Thr Arg Arg Tyr 
350 355 360 365 

ATG GCT CCA GAG GTG TTG GAG GGT GCT ATA AAC TTC CAA AGG GAC GCA 1213 
Met Ala Pro Glu Val Leu Glu Gly Ala lie Asn Phe Gin Arg Asp Ala 
370 375 380 

TTT CTG AGG ATA GAT ATG TAC GCC ATG GGA TTA GTC CTA TGG GAA TTG 1261 
Phe Leu Arg lie Asp Met Tyr Ala Met Gly Leu Val Leu Trp Glu Leu 
385 390 395 

GCT TCT CGT TGC ACT GCT GCA GAT GGA CCC GTA GAT GAG TAC ATG TTA 1309 
Ala Ser Arg Cys Thr Ala Ala Asp Gly Pro Val Asp Glu Tyr Met Leu 
400 405 410 

CCA TTT GAG GAA GAA ATT GGC CAG CAT CCA TCT CTT GAA GAT ATG CAG 1357 
Pro Phe Glu Glu Glu lie Gly Gin His Pro Ser Leu Glu Asp Met Gin 
415 420 425 

GAA GTT GTT GTG CAT AAA AAA AAG AGG CCT GTT TTA AGA GAT TAT TGG 1405 
Glu Val Val Val His Lys Lys Lys Arg Pro Val Leu Arg Asp Tyr Trp 
430 435 440 445 

CAG AAA CAT GCA GGA ATG GCA ATG CTC TGT GAA ACG ATA GAA GAA TGT 1453 
Gin Lys His Ala Gly Met Ala Met Leu Cys Glu Thr lie Glu Glu Cys 
450 455 460 

TGG GAT CAT GAT GCA GAA GCC AGG TTA TCA GCT GGA TGT GTA GGT GAA 1501 
Trp Asp His Asp Ala Glu Ala Arg Leu Ser Ala Gly Cys Val Gly Glu 
465 470 475 

AGA ATT ACT CAG ATG CAA AGA CTA ACA AAT ATC ATT ACT ACA GAG GAC 1549 
Arg lie Thr Gin Met Gin Arg Leu Thr Asn lie lie Thr Thr Glu Asp 
480 485 490 



# 




13 
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w 
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ATT GTA ACA GTG GTC ACA ATG GTG ACA AAT GTT GAC TTT CCT CCC AAA 1597 
lie Val Thr Val Val Thr Met Val Thr Asn Val Asp Phe Pro Pro Lys 
495 500 505 

GAA TCT AGT CTA TGATGGTGGC ACCGTCTGTA CACACTGAGG ACTGGGACTC 164 9 

Glu Ser Ser Leu 

510 



TGAACTGGAG 


CTGCTAAGCT 


7V TV /"'/"I TV TV TV ^^TV** 

AAGGAAAGTG 


LH AG 1 x G A X 


X X ILlbiulb 


a a aTPaPTap 
AAA X bAb X Avj 


1 *7ftQ 


GATGCCTCCA 


tv p a tvt ta p 
GGAC ATG I AG 


G LAAuuAoL G 


ppTTTSTYsn a a 

V— V*. X X Aj X VJUAH 


A f2 P A TCifi 71 TP 


X VJUVlnUn X VJVJ 


176 9 


ATCTGGGAAA 


CTTACTGCA 1 


GG 1C1 vjV_AvjI~ 


ALAbA i Ai Vj/A. 


ars & an a otpt^ 


a annnzi a a an 


182 9 


CTGCAAACTG 


rp TV TV TV rf" 1 TV TV pnwp 

TAAAbAAL I I 




7X PTPfi 7i Hd 7X A 


1\J i vjVjv_ v_v„ x v_ 


TPPA A ATPA A 


1889 


GGATCTTTT G 


TV f^^*TV2/* , ^ , *T , TV 

IjALL lAiuL 1 A 


A X X A X X 


•rnpaaa APTY2 


APATPAOATT 


tptt a a*ty3*pp 

IVrl 1 rin X U X V- 


1949 


m/if|in TV /"* TV TV/™* TV 

TGTCAGAAGA 


/~i TV /"WP7V A r P r Pf , /" t 

UiL 1AAJ. X V-t- 


nwnii tv Tv TY2TV a r" 1 


X >VV~ X VJV_ X J-\. X X 


TTTTTTIi a at 

X X X X X X nrirt X 


ClAAAAAPTTT 

U/vUvlnV' XXX 


2009 


TV i|H|MH/i TV TV 


rrvTwprpTV TV TV TV TV 


VjVJ X X X X X 


X /A. X X X X X 


fip*PGT*i , fi r r i T* r r 

VJ\. X VJ X X \J XXX 


CTATAAATGA 


2069 


CTATTGTAAT 


GCCAACATGA 


CACAGCTTGT 


GAATGTGTAG 


TGTGCTGCTG 


TTCTGTGTAC 


2129 


ATAGTCATCA 


AAGTGGGGTA 


CAGTAAAGAG 


GCTTCCAAGC 


ATTACTTTAA 


CCTCCCTCAA 


2189 


CAAGGTATAC 


CTCAGTTCCA 


CGGTTGTTAA 


ATTATAAAAT 


TGAAAACACT 


AACAGAATTT 


2249 


GAATAAATCA 


GTCCATGTTT 


TATAACAAGG 


TTAATTACAA 


ATTCACTGTG 


TTATTTAAGA 


2309 


AAAAATGGTA 


AGCTATGCTT 


AGTGCCAATA 


GTAAGTGGCT 


ATTTGTAAAG 


CAGTGTTTTA 


2369 


GCTTTTCTTC 


TACTGGCTTG 


TAATTTAGGG 


AAAACAAGTG 


CTGTCTTTGA 


AATGGAAAAG 


2429 


AATATGGTGT 


CACCCTACCC 


CCCATACTTA 


TATCAAGGTC 


CCAAAATATT 


CTTTTCCATT 


2489 


TCAAAGACAG 


CACTTTGAAA 


ACCCTAAATT 


ACAAGCCAGT 


AGAAGAAAAG 


CTAAAACACG 


2549 


CTTTACAAAT 


AGCC 










2563 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Ala Ala Ala Lys Leu Ala Phe Ala Val Phe Leu lie Ser Cys 
15 10 15 

Ser Ser Gly Ala lie Leu Gly Arg Ser Glu Thr Gin Glu Cys Leu Phe 
20 25 30 

Phe Asn Ala Asn Trp Glu Arg Asp Arg Thr Asn Gin Thr Gly Val Glu 
35 40 45 

Pro Cys Tyr Gly Asp Lys Asp Lys Arg Arg His Cys Phe Ala Thr Trp 
50 55 60 

Lys Asn lie Ser Gly Ser lie Glu lie Val Lys Gin Gly Cys Trp Leu 
65 70 75 80 



Asp Asp lie Asn Cys Tyr Asp Arg Thr Asp Cys lie Glu Lys Lys Asp 
85 90 95 

Ser Pro Glu Val Tyr Phe Cys Cys Cys Glu Gly Asn Met Cys Asn Glu 
100 105 110 

Lys Phe Ser Tyr Phe Pro Glu Met Glu Val Thr Gin Pro Thr Ser Asn 
115 120 125 

Pro Val Thr Pro Lys Pro Pro Tyr Tyr Asn lie Leu Leu Tyr Ser Leu 
130 135 140 

Val Pro Leu Met Leu lie Ala Gly lie Val lie Cys Ala Phe Trp Val 
145 150 155 160 

Tyr Arg His His Lys Met Ala Tyr Pro Pro Val Leu Val Pro Thr Gin 
165 170 175 

Asp Pro Gly Pro Pro Pro Pro Ser Pro Leu Leu Gly Leu Lys Pro Leu 
180 185 190 

Gin Leu Leu Glu Val Lys Ala Arg Gly Arg Phe Gly Cys Val Trp Lys 
195 200 205 

Ala Gin Leu Leu Asn Glu Tyr Val Ala Val Lys lie Phe Pro lie Gin 
210 215 220 

Asp Lys Gin Ser Trp Gin Asn Glu Tyr Glu Val Tyr Ser Leu Pro Gly 
225 230 235 240 

Met Lys His Glu Asn lie Leu Gin Phe lie Gly Ala Glu Lys Arg Gly 
245 250 255 

Thr Ser Val Asp Val Asp Leu Trp Leu lie Thr Ala Phe His Glu Lys 
260 265 270 

Gly Ser Leu Ser Asp Phe Leu Lys Ala Asn Val Val Ser Trp Asn Glu 
275 280 285 

Leu Cys His lie Ala Glu Thr Met Ala Arg Gly Leu Ala Tyr Leu His 
290 295 300 

Glu Asp lie Pro Gly Leu Lys Asp Gly His Lys Pro Ala lie Ser His 
305 310 315 320 

Arg Asp lie Lys Ser Lys Asn Val Leu Leu Lys Asn Asn Leu Thr Ala 
325 330 335 

Cys lie Ala Asp Phe Gly Leu Ala Leu Lys Phe Glu Ala Gly Lys Ser 
340 345 350 

Ala Gly Asp Thr His Gly Gin Val Gly Thr Arg Arg Tyr Met Ala Pro 
355 360 365 

Glu Val Leu Glu Gly Ala lie Asn Phe Gin Arg Asp Ala Phe Leu Arg 
370 375 380 

lie Asp Met Tyr Ala Met Gly Leu Val Leu Trp Glu Leu Ala Ser Arg 
385 390 395 400 

Cys Thr Ala Ala Asp Gly Pro Val Asp Glu Tyr Met Leu Pro Phe Glu 
405 410 415 

Glu Glu lie Gly Gin His Pro Ser Leu Glu Asp Met Gin Glu Val Val 
420 425 430 



Val His Lys Lys Lys Arg Pro Val 
435 440 

Ala Gly Met Ala Met Leu Cys Glu 
450 455 

Asp Ala Glu Ala Arg Leu Ser Ala 
465 470 

Gin Met Gin Arg Leu Thr Asn lie 
485 

Val Val Thr Met Val Thr Asn Val 
500 

Leu 



Leu Arg Asp Tyr Trp Gin Lys His 
445 

Thr lie Glu Glu Cys Trp Asp His 
460 

Gly Cys Val Gly Glu Arg lie Thr 
475 480 

lie Thr Thr Glu Asp lie Val Thr 
490 495 

Asp Phe Pro Pro Lys Glu Ser Ser 
505 510 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: XACTR 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 468.. 1997 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCGCCCACAC 


AGTGCAGTGA 


ATAATAGCCG 


GTGCGGCCCC 


TCCCCTCTTT 


CCCTGGCAGT 


TGTGTATCTG 


TCACATTGAA 


GTTTGGGCTC 


CTGTGAGTCT 


GAGCCTCCCC 


CTGTGTCTCA 


TGTGAAGCTG 


CTGCTGCAGA 


AGGTGGAGTC 


GTTGCATGAG 


GGTGGGGGGA 


GTCGCTGCTG 


TTTGATCTGC 


CTCTGCTCCC 


CATTCACACT 


CTCATTTCAT 


TCCCACGGAT 


CCACATTACA 


ACTCGCCTTT 


AACCCTTTCC 


CTGGCGGAGC 


CCACGCGTCT 


TTCATCCCTC 


CTGCCGCGGC 


CGCTGAGCGA 


CCAGAGCGCG 


ACATTGTTGC 


GGCGGGGGAT 


TGGGCGACAT 


TGTTGCGAAT 


AATCGGAGCT 


GCTGGGGGGG 


AACTGATACA 


ACGTTGCGAC 


TGTAAAGGAA 


TTAACTCGGC 


CGAATGGGAT 


TTTATCTGTG 


TCGGTGAGAG 


AAGCGGATCC 


CAGGAGC ATG GGG GCG 



Met Gly Ala 
1 



TCT GTA GCG CTG ACT TTT CTA CTT CTT CTT GCA ACT TTC CGC GCA GGC 
Ser Val Ala Leu Thr Phe Leu Leu Leu Leu Ala Thr Phe Arg Ala Gly 
5 10 15 



TCA GGA CAC GAT GAA GTG GAG ACA AGA GAG TGC ATC TAT TAC AAT GCC 
Ser Gly His Asp Glu Val Glu Thr Arg Glu Cys lie Tyr Tyr Asn Ala 
20 25 30 35 
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AAC TGG GAA CTG GAG AAG ACC AAC CAA AGT GGG GTG GAA AGC TGC GAA 
Asn Trp Glu Leu Glu Lys Thr Asn Gin Ser Gly Val Glu Ser Cys Glu 
40 45 50 



620 



GGG GAA AAG GAC AAG CGA CTC CAC TGT TAC GCG TCT TGG AGG AAC AAT 
Gly Glu Lys Asp Lys Arg Leu His Cys Tyr Ala Ser Trp Arg Asn Asn 
55 60 65 



666 



TCG GGC TTC ATA GAG CTG GTG AAA AAA GGA TGC TGG CTG GAT GAC TTC 
Ser Gly Phe lie Glu Leu Val Lys Lys Gly Cys Trp Leu Asp Asp Phe 
70 75 80 



716 



AAC TGT TAT GAC AGA CAG GAA TGT ATT GCC AAG GAA GAA AAC CCC CAA 
Asn Cys Tyr Asp Arg Gin Glu Cys lie Ala Lys Glu Glu Asn Pro Gin 
85 90 95 



764 



GTC TTT TTC TGC TGC TGC GAG GGA AAC TAC TGC AAC AAG AAA TTT ACT 
Val Phe Phe Cys Cys Cys Glu Gly Asn Tyr Cys Asn Lys Lys Phe Thr 
100 105 110 115 



812 



CAT TTG CCT GAA GTC GAA ACA TTT GAT CCG AAG CCC CAG CCG TCA GCC 860 
His Leu Pro Glu Val Glu Thr Phe Asp Pro Lys Pro Gin Pro Ser Ala 
120 125 130 

TCC GTA CTG AAC ATT CTG ATC TAT TCC CTG CTT CCA ATT GTT GGT CTT 908 
Ser Val Leu Asn lie Leu lie Tyr Ser Leu Leu Pro He Val Gly Leu 
135 140 145 



TCC ATG GCA ATT CTC CTG GCG TTC TGG ATG TAC CGT CAT CGA AAG CCT 
Ser Met Ala He Leu Leu Ala Phe Trp Met Tyr Arg His Arg Lys Pro 
150 155 160 



956 



v. 

I y 



CCC TAC GGG CAT GTA GAG ATC AAT GAG GAC CCC GGT CTG CCC CCT CCA 1004 
Pro Tyr Gly His Val Glu He Asn Glu Asp Pro Gly Leu Pro Pro Pro 
165 170 175 

TCT CCT CTG GTC GGG CTG AAG CCG CTG CAG TTG CTG GAG ATA AAG GCG 1052 
Ser Pro Leu Val Gly Leu Lys Pro Leu Gin Leu Leu Glu He Lys Ala 
180 185 190 195 

CGA GGC CGT TTC GGT TGC GTC TGG AAA GCT CGT CTG CTG AAT GAA TAT 1100 
Arg Gly Arg Phe Gly Cys Val Trp Lys Ala Arg Leu Leu Asn Glu Tyr 
200 205 210 



GTC GCA GTG AAA ATC TTC CCC GTG CAG GAT AAG CAG TCG TGG CAG TGT 
Val Ala Val Lys He Phe Pro Val Gin Asp Lys Gin Ser Trp Gin Cys 
215 220 225 



1148 



GAG AAA GAG ATC TTC ACC ACG CCG GGC ATG AAA CAT GAA AAC CTA TTG 
Glu Lys Glu He Phe Thr Thr Pro Gly Met Lys His Glu Asn Leu Leu 
230 235 240 



1196 



GAG TTC ATT GCC GCT GAG AAG AGG GGA AGC AAC CTG GAG ATG GAG CTG 
Glu Phe He Ala Ala Glu Lys Arg Gly Ser Asn Leu Glu Met Glu Leu 
245 250 255 



1244 



TGG CTC ATC ACT GCA TTT CAT GAT AAG GGT TCT CTG ACG GAC TAC CTG 
Trp Leu lie Thr Ala Phe His Asp Lys Gly Ser Leu Thr Asp Tyr Leu 
260 265 270 275 



1292 



AAA GGG AAC TTG GTG AGC TGG AAT GAA CTG TGT CAC ATA ACA GAA ACA 
Lys Gly Asn Leu Val Ser Trp Asn Glu Leu Cys His He Thr Glu Thr 
280 285 290 



1340 



ATG GCT CGT GGG CTG GCC TAC TTA CAT GAA GAT GTG CCC CGC TGT AAA 
Met Ala Arg Gly Leu Ala Tyr Leu His Glu Asp Val Pro Arg Cys Lys 
295 300 305 



1388 



□ 



r r "' 
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GQT GAA GGG CAC AAA CCT OCA ATC GCT CAC AGA GAT TTT AAA AGT AAG 1436 

Gly Glu Gly His Lys Pro Ala lie Ala His Arg Asp Phe Lys Ser Lys 
310 315 320 

AAT GTA TTG CTA AGA AAC GAC CTG ACT GCG ATA TTA GCA GAC TTC GGG 1484 
Asn Val Leu Leu Arg Asn Asp Leu Thr Ala lie Leu Ala Asp Phe Gly 
325 330 335 

CTG GCC GTA CGA TTT GAG CCT GGA AAA CCT CCG GGA GAT ACA CAC GGG 1532 
Leu Ala Val Arg Phe Glu Pro Gly Lys Pro Pro Gly Asp Thr His Gly 
340 345 350 355 

CAG GTT GGC ACC AGG AGG TAT ATG GCT CCT GAG GTT CTA GAG GGA GCA 1580 
Gin Val Gly Thr Arg Arg Tyr Met Ala Pro Glu Val Leu Glu Gly Ala 
360 365 370 

ATT AAC TTT CAG CGA GAT TCC TTT CTC AGG ATA GAT ATG TAT GCC ATG 1628 
lie Asn Phe Gin Arg Asp Ser Phe Leu Arg lie Asp Met Tyr Ala Met 
375 380 385 

GGA CTG GTA CTC TGG GAA ATA GTA TCC CGA TGT ACA GCA GCA GAT GGG 1676 
Gly Leu Val Leu Trp Glu lie Val Ser Arg Cys Thr Ala Ala Asp Gly 
390 395 400 

CCA GTA GAT GAG TAT CTG CTC CCA TTC GAA GAA GAG ATT GGG CAA CAT 1724 
Pro Val Asp Glu Tyr Leu Leu Pro Phe Glu Glu Glu lie Gly Gin His 
405 410 415 

CCT TCC CTA GAG GAT CTG CAA GAA GTT GTC GTT CAC AAG AAG ATA CGC 1772 
Pro Ser Leu Glu Asp Leu Gin Glu Val Val Val His Lys Lys lie Arg 
420 425 430 435 

CCT GTA TTC AAA GAC CAC TGG CTG AAA CAC CCT GGT CTG GCC CAA CTG 1820 
Pro Val Phe Lys Asp His Trp Leu Lys His Pro Gly Leu Ala Glh Leu 
440 445 450 

TGC GTC ACC ATT GAA GAA TGC TGG GAC CAT GAT GCG GAA GCA CGG CTT 1868 
Cys Val Thr lie Glu Glu Cys Trp Asp His Asp Ala Glu Ala Arg Leu 
455 460 465 



^5 TCG GCA GGC TGC GTA GAG GAG CGT ATT TCC CAA ATC CGT AAA TCA GTG 1916 

0 Ser Ala Gly Cys Val Glu Glu Arg lie Ser Gin lie Arg Lys Ser Val 

470 475 480 

AAC GGC ACT ACC TCG GAC TGC CTT GTA TCC ATT GTT ACA TCT GTC ACC 1964 
Asn Gly Thr Thr Ser Asp Cys Leu Val Ser lie Val Thr Ser Val Thr 
485 490 495 

AAT GTG GAC TTG CCG CCC AAA GAG TCC AGT ATC TGAGGTTTCT TTGGTCTTTC 2 017 

Asn Val Asp Leu Pro Pro Lys Glu Ser Ser lie 



500 


505 


510 








CAGACTCAGT 


GACTTTTAAA 


AAAAAAACTC 


ACGAATGCAG 


CTGCTATTTT 


ATCTTGACTT 


2077 


TTTAATATTT 


TTTTTCTTGG 


ATTTTACTTG 


GATCGGATCA 


ATTTACCAGC 


ACGTCATTCG 


2137 


AAAGTATTAA 


AAAAAAAAAA 


CAAAACAAAA 


AAGCAAAAAC 


AGACATCTCA 


GCAAGCATTC 


2197 


AGGTGCCGAC 


TTATGAATGC 


CAATAGGTGC 


AGGAACTTCA 


GAACCTCAAC 


AAACTCATTT 


2257 


CTAGAGAATG 


TTCTCCTGGT 


TTCCTTTATC 


TCAGAAGAGG 


ACCCATAGGA 


AAACACCTAA 


2317 


GTCAAGCAAA 


TGCTGCAG 










2335 
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(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Ser Val Ala Leu Thr Phe Leu Leu Leu Leu Ala Thr Phe 
15 10 15 

Arg Ala Gly Ser Gly His Asp Glu Val Glu Thr Arg Glu Cys lie Tyr 
20 25 30 

Tyr Asn Ala Asn Trp Glu Leu Glu Lys Thr Asn Gin Ser Gly Val Glu 
35 40 45 

Ser Cys Glu Gly Glu Lys Asp Lys Arg Leu His Cys Tyr Ala Ser Trp 
50 55 60 

Arg Asn Asn Ser Gly Phe lie Glu Leu Val Lys Lys Gly Cys Trp Leu 
65 70 75 80 

Asp Asp Phe Asn Cys Tyr Asp Arg Gin Glu Cys lie Ala Lys Glu Glu 
85 90 95 

Asn Pro Gin Val Phe Phe Cys Cys Cys Glu Gly Asn Tyr Cys Asn Lys 
100 105 110 

Lys Phe Thr His Leu Pro Glu Val Glu Thr Phe Asp Pro Lys Pro Gin 
115 120 125 

Pro Ser Ala Ser Val Leu Asn lie Leu lie Tyr Ser Leu Leu Pro lie 
130 135 140 

Val Gly Leu Ser Met Ala lie Leu Leu Ala Phe Trp Met Tyr Arg His 
145 150 155 160 

Arg Lys Pro Pro Tyr Gly His Val Glu lie Asn Glu Asp Pro Gly Leu 
165 170 175 

Pro Pro Pro Ser Pro Leu Val Gly Leu Lys Pro Leu Gin Leu Leu Glu 
180 185 190 

lie Lys Ala Arg Gly Arg Phe Gly Cys Val Trp Lys Ala Arg Leu Leu 
195 200 205 

Asn Glu Tyr Val Ala Val Lys He Phe Pro Val Gin Asp Lys Gin Ser 
210 215 220 

Trp Gin Cys Glu Lys Glu He Phe Thr Thr Pro Gly Met Lys His Glu 
225 230 235 240 

Asn Leu Leu Glu Phe He Ala Ala Glu Lys Arg Gly Ser Asn Leu Glu 
245 250 255 

Met Glu Leu Trp Leu He Thr Ala Phe His Asp Lys Gly Ser Leu Thr 
260 265 270 

Asp Tyr Leu Lys Gly Asn Leu Val Ser Trp Asn Glu Leu Cys His He 
275 280 285 

Thr Glu Thr Met Ala Arg Gly Leu Ala Tyr Leu His Glu Asp Val Pro 
290 295 300 
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Arg Cys Lys Gly Glu Gly His Lys Pro Ala lie Ala His Arg Asp Phe 
305 310 315 320 

Lys Ser Lys Asn Val Leu Leu Arg Asn Asp Leu Thr Ala He Leu Ala 
325 330 335 

Asp Phe Gly Leu Ala Val Arg Phe Glu Pro Gly Lys Pro Pro Gly Asp 
340 345 350 

Thr His Gly Gin Val Gly Thr Arg Arg Tyr Met Ala Pro Glu Val Leu 
355 360 365 

Glu Gly Ala He Asn Phe Gin Arg Asp Ser Phe Leu Arg He Asp Met 
370 375 380 

Tyr Ala Met Gly Leu Val Leu Trp Glu He Val Ser Arg Cys Thr Ala 
385 390 395 400 

Ala Asp Gly Pro Val Asp Glu Tyr Leu Leu Pro Phe Glu Glu Glu He 
405 410 415 

Gly Gin His Pro Ser Leu Glu Asp Leu Gin Glu Val Val Val His Lys 
420 425 430 

Lys He Arg Pro Val Phe Lys Asp His Trp Leu Lys His Pro Gly Leu 
435 440 445 

Ala Gin Leu Cys Val Thr He Glu Glu Cys Trp Asp His Asp Ala Glu 
450 455 460 

Ala Arg Leu Ser Ala Gly Cys Val Glu Glu Arg He Ser Gin He Arg 
465 470 475 480 

Lys Ser Val Asn Gly Thr Thr Ser Asp Cys Leu Val Ser He Val Thr 
485 490 495 

Ser Val Thr Asn Val Asp Leu Pro Pro Lys Glu Ser Ser He 
500 505 510 

(2) INFORMATION FOR SEQ ID NO: 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Leu Lys Pro Glu Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



-55- 



(ix^ FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa at position 2 is either 
"Thr" or "Ser" . M 

(ix) itfSATURE: 

i(A) NAME /KEY: Modif ied-site 
3) LOCATION: 5 

(b) OTHER INFORMATION: /note= "Xaa a position 5 is either 
"Tyr" or "Phe".*' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Gly Xaa Xaa Xaa Xaa Xaa 
1 \ 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYEE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE : protein 

(v) FRAGMENT TOPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp lie Lys Ser Lys Asn 
1 5 

(2) INFORMATION FOR SEqVeD NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 ataino acids 

(B) TYPE: amino \acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION \ SEQ ID NO:8: 

Gly Thr Arg Arg Tyr Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i> SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 6 amino acic 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE; internal 



f 
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(xity SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asm Leu Ala Ala Arg Asn 

1 \ 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 
(jy) LENGTH: 6 amino acids 

TYPE: amino acid 
(DA TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(ix) FEATURE K 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHEH INFORMATION: /note= "Xaa at position 3 is either 
Le tt or M Val M . 

{ ix) FEATURE : 

(A) NAME/K&Y: Modif ied-site 

(B) LOCATION: 4 
(D) OTHER IOTORMATION: /note= "Xaa at position 4 is either 

"Lys w \or "Arg" . " 

(ix) FEATURE: 

(A) NAME/ KEY : \Modif ied-site 
<B) LOCATION: 

(D) OTHER INFORMATION: /note= "Xaa at position 6 is either 
"Thr" orX^Met". 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Xaa Pro Xaa Xaa Trp 
1 5 



Xaa\ 



(2) INFORMATION FOR SEQ ID NO:\Ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1602 base Wirs 

(B) TYPE: nucleic acid\ 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 72.. 1553 



(xi) SEQUENCE DESCRIPTION: SEQ ID N$:ll: 
CCCCGGGAAC TTCAAAGCGC GCTGCGGCGG CGCTCTGAGA CCCCGAAGCC TTGCACCGCC 



GCGGGGTGGC C ATG ACC CCA GCG CGC CGC TCC 

Met Thr Pro Ala Arg Arg Ser A^ 
1 5 



CTG AGC CTG GCC CTC 
Leu Ser Leu Ala Leu 
10 



60 
110 



-57- 



CTG C*ft3 GTG GCA CTG GCC TCC GAC CTT GCG GCA GGA CTG AAG TGT GTG 
Leu Lem Val Ala Leu Ala Ser Asp Leu Ala Ala Gly Leu Lys Cys Val 
ia 20 25 



158 



TGT CTTVTTG TGT GAT TCC TCA AAC TTT ACC TGC CAA ACC GAA GGA GCA 
Cys Leu Leu Cys Asp Ser Ser Asn Phe Thr Cys Gin Thr Glu Gly Ala 
30 \ 35 40 45 



206 



TGC TGG G&C TCT GTC ATG CTA ACC AAC GGG AAA GAA CAG GTG AGC AAA 
Cys Trp Ala Ser Val Met Leu Thr Asn Gly Lys Glu Gin Val Ser Lys 
50 55 60 



254 



TCG TGC GTG ^ cc CTC CCG GAA CTA AAT GCT CAG GTC TTC TGT CAC AGT 
Ser Cys Val SLer Leu Pro Glu Leu Asn Ala Gin Val Phe Cys His Ser 

70 75 



302 



TCC AAC AAC GT& ACC AAG ACC GAA TGT TGC TTC ACA GAC TTC TGC AAC 
Ser Asn Asn ValVrhr Lys Thr Glu Cys Cys Phe Thr Asp Phe Cys Asn 
80 \ 85 90 



350 



AAC ATC ACT CAG C&C CTT CCC ACA GCA TCT CCA GAT GCC CCT AGA CTT 
Asn lie Thr Gin Hits Leu Pro Thr Ala Ser Pro Asp Ala Pro Arg Leu 
95 \ 100 105 



398 



GGC CCC ACA GAG CTG \ACA GTT GTT ATC ACT GTA CCT GTT TGC CTC CTG 
Gly Pro Thr Glu Leu Thr Val Val lie Thr Val Pro Val Cys Leu Leu 
110 m5 120 125 



446 



TCC ATC GCA GCC ATG Cfk ACG ATA TGG GCC TGC CAG GAC CGC CAG TGC 
Ser lie Ala Ala Met Leu\ Thr lie Trp Ala Cys Gin Asp Arg Gin Cys 
130 \ 135 140 



494 



ACA TAC AGG AAG ACC AAG 
Thr Tyr Arg Lys Thr Lys 
145 



CAC AAT GTG GAG GAA CCA CTG GCA GAG 
His Asn Val Glu Glu Pro Leu Ala Glu 
150 155 



542 



TAC AGC CTT GTC AAT GCT GGA \AAA ACC CTC AAA GAT CTG ATT TAT GAT 
Tyr Ser Leu Val Asn Ala Gly Lys Thr Leu Lys Asp Leu He Tyr Asp 
160 rfcs 170 



590 



GCC ACT GCC TCG GGC TCA GGA TC* GGC CCG CCT CTT TTG GTT CAA AGA 
Ala Thr Ala Ser Gly Ser Gly Ser\ Gly Pro Pro Leu Leu Val Gin Arg 
175 180 \ 185 



638 



ACC ATC GCA AGG ACA ATT GTA CTT 
Thr lie Ala Arg Thr He Val Leu Gl 
190 195 



GAA ATC GTA GGA AAA GGT CGG 
Glu He Val Gly Lys Gly Arg 
200 205 



686 



TTT GGG GAA GTG TGG CAC GGA AGA TGG\TGT GGA GAA GAT GTG GCT GTG 
Phe Gly Glu Val Trp His Gly Arg Trp Vys Gly Glu Asp Val Ala Val 
210 2^.5 220 



734 



AAA ATA TTC TCC TCC AGA GAT GAG AGA TCT TGG TTC CGT GAG GCA GAA 
Lys He Phe Ser Ser Arg Asp Glu Arg SerWrp Phe Arg Glu Ala Glu 
225 230 \ 235 



782 



ATT TAT CAG ACG GTA ATG CTG AGA CAT GAG 
He Tyr Gin Thr Val Met Leu Arg His Glu 
240 245 



\T ATT CTC GGT TTC ATC 
He Leu Gly Phe He 
250 



830 



GCG GCC GAC AAC AAA GAT AAT GGA ACC TGG ACT\CAG CTT TGG CTT GTG 
Ala Ala Asp Asn Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val 
255 260 2$5 



878 



TCA GAG TAT CAC GAG CAG GGC TCC TTA TAT GAC TOflF TTG AAT AGA AAC 
Ser Glu Tyr His Glu Gin Gly Ser Leu Tyr Asp Tylr Leu Asn Arg Asn 
270 275 280 \ 285 



-58- 



ATA GT& ACC GTG GCT GGA ATG GTC AAG TTG GCG CTT TCA ATA GCG AGT 
He Va\ Thr Val Ala Gly Met Val Lys Leu Ala Leu Ser lie Ala Ser 
290 295 300 



974 



GGT CTG OCT CAC CTA CAC ATG GAG ATC GTG GGC ACT CAA GGT AAG CCT 
Gly Leu A^a His Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro 
305 310 315 



1022 



GCT ATT GCT^CAC CGA GAT ATA AAG TCA AAG AAT ATC TTA GTC AAA AAG 
Ala He Ala wis Arg Asp He Lys Ser Lys Asn He Leu Val Lys Lys 

325 330 



1070 



TGT GAC ACT TOT GCC ATA GCT GAC TTA GGG CTG GCT GTG AAA CAT GAT 
Cys Asp Thr Cyfe Ala He Ala Asp Leu Gly Leu Ala Val Lys His Asp 
335 \ 340 345 



1118 




TCT ATC ATG AAC 
Ser He Met Asn 
350 



ATA GAT ATA CCC CAG AAT CCT AAA GTG GGA ACC 
ir He Asp He Pro Gin Asn Pro Lys Val Gly Thr 
355 360 365 



AAG AGG TAT ATG GCS CCC GAA ATG CTT GAT GAT ACA ATG AAC GTC AAC 
Lys Arg Tyr Met AlaXPro Glu Met Leu Asp Asp Thr Met Asn Val Asn 
370\ 375 380 



ATC TTT GAG TCC TTC 
He Phe Glu Ser Phe 
385 



[IG CGA GCT GAC ATC TAT TCG GTG GGG CTG GTT 
Arg Ala Asp He Tyr Ser Val Gly Leu Val 
390 395 



1166 



1214 



1262 



TAC TGG GAA ATA GCT CGA yiGG TGT TCA GTT GGA GGA CTT GTT GAA GAG 
Tyr Trp Glu He Ala Arg Arg Cys Ser Val Gly Gly Leu Val Glu Glu 
400 \ 405 410 



1310 



TAC CAG TTG CCT TAT TAT GAC ATG GTG CCT TCA GAT CCT TCC ATA GAG 
Tyr Gin Leu Pro Tyr Tyr AsriMet Val Pro Ser Asp Pro Ser He Glu 
415 420\ 425 



1358 



GAA ATG AGG AAG GTC GTT TGT 
Glu Met Arg Lys Val Val Cys 
430 435 



\T CAG AAA CTG CGA CCA AAT CTC CCA 
Gin Lys Leu Arg Pro Asn Leu Pro 
440 445 



AAC CAG TGG CAA AGC TGT GAG GCG\CTC CGG GTC ATG GGA AGA ATA ATG 
Asn Gin Trp Gin Ser Cys Glu Ala >Leu Arg Val Met Gly Arg He Met 
450 \ 455 460 



1406 



1454 



CGT GAG TGC TGG TAT GCC AAC GGG 
Arg Glu Cys Trp Tyr Ala Asn Gly 

465 47* 



GCT CGC CTG ACC GCC CTG CGC 
Ala Arg Leu Thr Ala Leu Arg 
475 



1502 



GTG AAG AAG ACC ATT TCT CAG CTG TGT\GTC AAG GAA GAC TGT AAG GCC 
Val Lys Lys Thr He Ser Gin Leu Cys Val Lys Glu Asp Cys Lys Ala 
480 485 \ 490 



1550 



TAAGGATACA GGCGACGGGA AAGCCCTCAC CACTCTCTTT CATGTCTCCT GC 



1602 



• 
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(2/j\ INFORMATION FOR SEQ ID NO: 12: 

\ (i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 493 amino acids 

\ (B) TYPE: amino acid 

\ (D) TOPOLOGY: linear 

Vii) MOLECULE TYPE: protein 

(W) SEQUENCE DESCRIPTION: SEQ ID NO: 12; 

Met Thr Wo Ala Arg Arg Ser Ala Leu Ser Leu Ala Leu Leu Leu Val 
1 \ 5 10 15 

Ala Leu Ala Ser Asp Leu Ala Ala Gly Leu Lys Cys Val Cys Leu Leu 
\ 20 25 30 

Cys Asp Ser\ Ser Asn Phe Thr Cys Gin Thr Glu Gly Ala Cys Trp Ala 
35\ 40 45 

Ser Val Met Leu Thr Asn Gly Lys Glu Gin Val Ser Lys Ser Cys Val 
50 V 55 60 

Ser Leu Pro Glu Leu Asn Ala Gin Val Phe Cys His Ser Ser Asn Asn 
65 \ 70 75 80 

Val Thr Lys Thrylu Cys Cys Phe Thr Asp Phe Cys Asn Asn lie Thr 
\85 90 95 

Gin His Leu Pro Thar Ala Ser Pro Asp Ala Pro Arg Leu Gly Pro Thr 
100 \ 105 110 

Glu Leu Thr Val ValVtle Thr Val Pro Val Cys Leu Leu Ser He Ala 
115 \ 120 125 

Ala Met Leu Thr He Ttp Ala Cys Gin Asp Arg Gin Cys Thr Tyr Arg 
130 \ 135 140 

Lys Thr Lys Arg His AsnYVal Glu Glu Pro Leu Ala Glu Tyr Ser Leu 
145 150 \ 155 160 

Val Asn Ala Gly Lys Thr Lfeu Lys Asp Leu He Tyr Asp Ala Thr Ala 
165 \ 170 175 

Ser Gly Ser Gly Ser Gly ProXPro Leu Leu Val Gin Arg Thr He Ala 
180 \ 185 190 

Arg Thr He Val Leu Gin Glu lie Val Gly Lys Gly Arg Phe Gly Glu 
195 200 205 

Val Trp His Gly Arg Trp Cys Gl\ Glu Asp Val Ala Val Lys He Phe 
210 215 \ 220 

Ser Ser Arg Asp Glu Arg Ser Trp she Arg Glu Ala Glu He Tyr Gin 
225 230 \ 235 240 

Thr Val Met Leu Arg His Glu Asn He\ Leu Gly Phe lie Ala Ala Asp 
245 \250 255 

Asn Lys Asp Asn Gly Thr Trp Thr Gin ueu Trp Leu Val Ser Glu Tyr 
260 265 \ 270 

His Glu Gin Gly Ser Leu Tyr Asp Tyr Leu\Asn Arg Asn He Val Thr 
275 280 \ 285 

Val Ala Gly Met Val Lys Leu Ala Leu Ser lie Ala Ser Gly Leu Ala 
290 295 \ 300 



Hi« \ " 6 °- 
His Leu His Met Qlu ll e » , 

3os r S Val am oly hya Pro 

asd n» t 320 



Asn ti . J15 e AIa 

Asp ne Lys Ser 320 

32S y Asn z le Leu Val i v = r 

330 Lya Cye Asp Thr 

■lie Ala t\ . 335 



<*» ma n e al 330 y cys As p Tl « 

1 Ile A la Asp Leu Glv r 335 
\ 340 * 6U ^u Ala Val Lys H i, a 

\" ° l ° JS «« «r «, lys _ „ 

\ 3 75 P Thr Met Asn val Asn ti. dk 

S « Phe Lyl Ara „ 380 ^ ° lu 

385 y l A 9 Ala Asp ii e ^ e 

\ 39? ^ r Val Gly Leu 

Ala Ara L ^ 395 ^ Trp Glu 

^9 ser val oiv M 400 
\ 405 Gly G1 y Leu Val Gl., p, 

^ * Met Va! Pro ser Asp Pro Ser ^ ^ 
^ValvalcylAspGlnL " 

Sins " V - ^ P - ^ - - Asn Gin Trp 

Q ln Ser Cys Glu Wl* r 445 ^ 

450 y F a I*u Arg Val Met Gly Ara ti 

\ 455 7 g Ile Met Ara „ 

JJP Tyr Ala Asn A.. - 4 *<> Ar9 Glu Cys 



IS AU L6U Ala Leu Arg Val L 

475 g al Lys 

480 



^^SerGlnLei^ Va . 

48 5 \ CyS Val **. «» Asp cys Lys Ala 

fi\ ^ \ 490 r AXa 



«) INTO «MA-rxo NTOR L IDNO:l3: 

(i) SEQUENCE CHAii^ MT 

(B) TYPE: nu I,° * )3Se P ai « 

'«) -OX.CULE type.. L (genomicj 

SEQUENCE DESCRIpIon. SEO Tn 
CGGGATCOGT NGCNGTNAAR ATHTTOCC ^ : 13: 
(2) INFORMATION FOR S£Q T£) 1^ 



MOLECULE TYPE: UNA 



Pic) 



CGGGATCCYT CNGGNGCCAT RTANCKYCTN g>I^c NO ^ 4 = 



28 



